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Abstract 

We describe a new algorithm for the decoding of Reed Solomon codes. This algo- 
rithm was originally described in [12]. While the algorithm presented in this article 
is the same, the presentation is somewhat different. In particular we derive more 
precise bounds on the performance of the algorithm and show the following: For an 
[n, kti, (1 — K)n\ q Reed Solomon code, the algorithm in [12] corrects (c(k) — o(l))n errors 
in polynomial time, where 



( \ 1 1 Pk h 
e(K) = 1 k where p K 



2 1 1 

k + 4 ~ 2 



We also present the following two (hopefully) useful lower bounds on c(k): 



Vk G [0, 1] e(K) > 1 - \2k + — + - > 1 - V2k. 

1 Introduction 

For integers n, k and q such that a finite held of size q exists, the Reed Solomon codes are 
[n,k,d = n — k] codes over the alphabet F = GF(q) (the Galois held of order q). The code 
may be obtained by letting a message m = m . . . rajt-i denote the coefficients of a degree 
k — 1 polynomial M(x) = J2jZo in-jZ 1 and letting the encoding of m be C(m) = c\ . . . c n 
where c 8 - = M(xi) where x\,...,x n are n distinct elements of F. (It is standard to pick 
n = q — 1, though not necessary to prove the distance property or for the algorithms we 
describe). 

In this paper we consider the following bounded distance decoding problem: 




Input: Integers n, k and e; and n pairs {(x 8 -, y;)}™ =1 , £;,y 8 ' G i* 1 with x 8 's being distinct 1 . 
Goal : Find all polynomials p\ , . . . , p m of degree k — 1 such that for every j G [ra] , pj (x 8 ) ^ y 8 - 
for at most e values of i G [ra] . 

As can be seen easily, this problem captures the bounded distance decoding problem for Reed 
Solomon codes. There is a rich history of work associated with this problem. The classical 
work of Berlekamp-Massey (cf. [2, 9]), corrects upto [(d — l)/2j errors. Sidelnikov [11] and 
Dumer [4] have constructed algorithms which correct up to [(d — l)/2j + clogn errors for 
any constant c [4, 11]. We give an algorithm that improves over these results when k/n is 
sufficiently small (i.e., less than 1/3). Our algorithm is motivated by an algorithm of Welch 
and Berlekamp [14, 3] which corrects [(d — l)/2j + 1 errors. In this article we describe the 
algorithm of Welch and Berlekamp and use it motivate our decoding algorithm. We also 
describe a crucial intermediate step from [1] which forms the basis of our algorithm. Our 
main result is summarized below and proven in Lemma 9. 

Theorem 1 For every e, k The bounded distance decoding problem with parameters n, k = 
nn and e = e(n)n can be solved in polynomial time provided 

e[K) < 1 k where p n = 

1 + p K 2 



2 Decoding with univariate rational functions [14, 3] 

The idea of Welch and Berlekamp [14, 3] can be informally described as follows: They 
describe how a "rational function" in x can be used to "explain" the "data" {(x 8 -, y;)}™ =1 . 
They then show how to efficiently find a rational function that explains the data and then 
show how to use this rational function to find the (unique) polynomial p which disagrees 
with the data in at most e places. We now describe the algorithms more formally. 

Lemma 2 ([14, 3]) Given n points {(x 8 -, y;)}™ =1 such that there exists a degree k — 1 poly- 
nomial p such that yi ^ p(x{) for at most e values of i, the following hold: 

1. There exist polynomials N(x) and D(x) where deg(iV) < k — 1 + e, D is monic with 
deg(D) = e, such that for every i G [n], N(xi) = yiD(xi). (Informally, we could say 
that yi equals the rational function -^(xi).) 

2. Such a pair of polynomials (iV, D) can be found in polynomial time. 

3. For any such pair of polynomials (N, D), -p(-) = p(-), provided e < |(n — k). 

Remark: As a consequence p can be found in polynomial time. We just divide the polyno- 
mials N and D to obtain p. 



1 Yoy our algorithm from [12] we can replace this condition with the weaker one that (x{, yi)'s are distinct. 



Proof: Let E(x) be an "error-locator" polynomial, i.e., E(x{) = if (but not necessarily 
only if) yi ^ p(xi). Notice that E has degree at most e and w.l.o.g. we can allow it to 
be monic and have degree exactly e. We now notice that the polynomials D(-) = E(-) and 
N(-) = p(-)E(-) satisfy the condition N(x{) = yiD(xi) for every i. This proves Part 1. 

To see Part 2, notice that if we let the unknowns {rij}^t. ~ denote the coefficients of N(-) 
and let the unknowns {dj} e - Q denote the coefficients of -D(-), then the constraints N(x{) = 
HiD(xi) give linear constraints on the unknowns {n 3 } } s and {o?j}'s. Also the constraint 
deg(D) = e. We do so by setting the linear constraint d e = 1. Thus a solution pair (N,D) 
can be found efficiently by solving a linear system. 2 

Finally notice that there exists a pair (A, D) such that -^ (•) = p(-), as described in the proof 
of Part 1. Thus to prove Part 3, it suffices to prove that for any pair of solutions (Ai, D\) 
and (N 2 ,D 2 ), satisfying Nj(xi) = yiDj(xi) every i £ [n] and j £ {1,2}, jy(-) = jf~(/)- To 
see this, hrst observe that for every i £ [n], we have N\{xi)D 2 {xi)yi = N 2 (xi)Di(xi)yi). 
Futhermore, we can cancel y 8 - from both sides (even if y 8 - = 0, since in such a case we have 
N\(xi) = N 2 (xi) = 0. This yields that for every i } (NiD 2 )(xi) = (N 2 Di)(xi). But then both 
sides describe polynomials of degree 2e + k — 1 and two sides agree on n points. By the 
condition on n, we have that n > 2e + k — 1 and thus the polynomials on the two sides are 
identical, i.e., (NiD 2 )(-) = (N 2 Di)(-). This yields the desired conclusion immediately. I 



3 Decoding with algebraic curves in the plane [1, 12] 

A slightly different interpretation of the Welch-Berlekamp algorithm is that it finds an alge- 
braic curve in the plane which "explains" the "data". To be precise, the algorithm finds a 
function Q(x } y) } where Q(x } y) = D(x)y — N(x), such that for every i £ [n], Q(xi 7 yi) = 0. 
While this particular scenario attempts to explain the data by a "linear" polynomial in y 
- there is no need to restrict the analysis to this situation. Ar et al. [1] considered such a 
generalization. They consider the case where the data is "explained" some algebraic curve 
Q of low degree in y (but not necessarily a linear polynomial in y). They show that in such 
a case, if there exists a polynomial p such that y 8 - = p(x{) for many values of i (compared 
to deg x (Q) and deg y (Q)) then p can be reconstructed easily. To describe their analysis, the 
following definition is useful. 

Definition 3 For positive integers w x and w y , the (w X} w y ) -weighted degree of a bivariate 
polynomial Q(x } y) = J2i,j Qij^U 1 ?s defined to be max{iw x + jw y \qi rj ^ 0}. 

Lemma 4 ([1]) Given n points {(x 8 -, y 8 ')}™ =1 s.t. there exists a bivariate polynomial Q sat- 
isfying: 

The (l,k — 1) weighted degree of Q is at most D, Q ^ and\/i £ [n], Q(xi 7 yi) = 0. (1) 

Then the following hold: 



2 Actually Berlekamp and Welch [14, 3] give a much more efficient solution for this task, but we will not 
describe their solution here. 



1. A polynomial Q satisfying equation (1) can be found in polynomial time. 

2. If p is a polynomial in x of degree at most k — 1 such that yi ^ p(x{) for at most 
e < n — D values of i then for any polynomial Q satisfying (1), it is the case that 
(y — p{x)) divides Q(x } y). 

Remark: As a consequence, a small set of polynomials which includes p can be found in 
polynomial time. We simply factor the polynomial Q obtained in Part 1 above and output 
all p such that y — p(x) divides Q. The polynomial Q can be factored in time polynomial in 
its degree using the algorithm of Kaltofen [7] or Grigoriev [6] (see also Kaltofen [8]). 

Proof: For Part 1, we observe as in the proof of Lemma 2 that for any i } the condition 
Q(xi,yi) = J^ji^ji^y 1 = is a linear constraint on the unknowns q 3 \. Thus a solution 
satisfying (1) can be found in polynomial time, if one exists. 

For Part 2, we consider the polynomial g(x) = Q(x } p(x)). Notice that since the (1,A; — 1)- 
weighted degree of Q is D } the degree of g is also at most D. Notice further that if for 
some i £ [n], y 8 - = p(xi) } then g(x{) = Q(xi 7 p(xi)) = Q(xi 7 yi) = 0. Thus g is zero on 
n — e > D points. Thus g is identically 0. Now consider the polynomial Q x (y) = Q(x } y) (i.e., 
the polynomial Q viewed as a polynomial in y with coefficients from the ring of polynomials 
in x). By the division theorem for polynomials we have that if Q x (() = 0, then y — ( divides 
Q x (y)- Applying this fact to Q x (p( x )) = Q( x ,p( x )) = g( x ) = we find that y — p(x) divides 

Q(x,y). I 

Notice the close correspondence between Lemmas 2 and 4. The main difference between the 
two is Part 1 of Lemma 2, the analog of which is missing in Lemma 4. As a result Lemma 4 
works conditionally, i.e., only when the "data" is explained by a low-degree algebraic curve. 
Our solution complements this by providing a "triviality" result. Namely, we observe that 
every set of points lies on a low degree algebraic curve; low enough to make Lemma 4 always 
useful! We illustrate this observation by a simple example 

Example 5 For any set of points {^ 8 ,y 8 '}™ =1 there exists a non-zero polynomial Q with 
deg x (Q) } deg y (Q)< \^\. 

Proof: Consider the linear system Q(x,y) = J2j=o J2\=o (Iji^iVi = 0. For every i this 
forms a homogenous linear system. The system has n constraints on (\\/n + l]) 2 > (y/n + 
l) 2 > n + 1 unknowns. By just counting the number of unknowns we know that the system 
has a non-zero solution (any homogenous linear system with more variables than unknowns 
has a non-zero solution). I 

Notice that unlike in Part 1 of Lemma 2, we didn't even use the fact that the data has some 
agreement with a degree k — 1 polynomial. Thus the example above is a "triviality" result 
- it holds for any set of n points. Yet the trivial result can be useful as seen next: 



Example 6 For any set of n distinct points {(x 8 -, y 8 ')}™ =1 the following hold: 

1. There exists a bivariate polynomial Q satisfying equation (1) with D = k^fn. 

2. Such a polynomial Q can be found in polynomial time. 

3. // p is a polynomial in x of degree at most k — 1 such that yi ^ p(x{) for at most 
e < n — k \y/n] values of i then for any polynomial Q satisfying (1) with D = k^fn, it 
is the case that (y — p(x)) divides Q(x } y). 



Proof: Part 1 follows from Example 5. Parts 2 and 3 follow from Parts 1 and 2 of Lemma 4 
with D = k\y/n\. I 



Notice that Example 6 is already correcting more errors than guaranteed by Lemma 2 for 
some values of n and k (in particular when k grows as o(y /r n). By some fine tuning we can 
actually get to the point where this algorithm always does at least as well as the Welch- 
Berlekamp algorithm and for rate less than 1/3 it starts correcting significantly more error. 
The fine tuning is performed by minimizing the weighted (1, k — 1) degree of the polynomial 
Q which is used to explain the data. 

Lemma 7 Given n and k, let t be the smallest positive integer satisfying: 



(t-l) 



t-\ 
k-1. 



+ 1 



(k-1) 



t- 1 
k-1. 



+ 1 



t- 1 
k-1 



> n. 



0. 



Then, for any set of n distinct points {(x 8 -, y 8 ')}™ =1 the following hold: 

1. There exists a bivariate polynomial Q satisfying: 

The (1, A; — 1) weighted degree of Q is < t, Q ^ and \/i £ [n], Q(xi 7 yi 

2. A polynomial Q satisfying (2) can be found in polynomial time. 

3. If p is a polynomial in x of degree at most k — 1 such that yi ^ p(x{) for at most e < n — t 
values of i then for any polynomial Q satisfying (2), it is the case that (y — p(x)) divides 
Q(x,y). 

Remark: Again as a consequence we can find a list of all polynomials p that agree with the 
points {(xi } yi)} inn-e points in time polynomial in n. 



Proof: A polynomial Q of (l,k — l)-weighted degree t is allowed to have a non-zero co- 
efficient q rj \ if j + (k — 1)1 < t. Counting the number of such coefficients we find that Q 
has 



L-^rr-J 

£(t-(fc-i)z + i 



) = (*-!) 



t-l 
k-1 
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+ 1 



t- 1 
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coefficients. Since this number is strictly greater than n we must have a non-zero solution. 
This proves Part 1. Parts 2 and 3 follow directly from Lemma 4. I 



4 Some bounds 



Let e(k } n) denote the maximum number of errors corrected by the algorithm of Lemma 7 

on an [n,k,d\ q Reed Solomon code. Let k = k/n denote the rate of a code and let e(/c) = 
linin^oo e(nn } n)/n denote the asymptotic error-correcting rate of the algorithm. In this 
section we derive the exact form of e(k } n) and e(/c). We also describe some lower bounds to 
both quantities. 



Lemma 8 

e(k } n) = n—(k—l)r kj . 



2n - (k - l)r k:n (r k:n + 1) 
2{r k , n + 1) 
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Proof: Let t be the smallest integer satisfying 
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t- 1 
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> n. 



By Lemma 7 e(&, n) = n — t — 1. Let t — 1 = r{k — 1) + s, where r and s are integers and 
< 5 < A; — 1. Then LfzrJ = r ' anc ^ r anc ^ s sa ti s fy 



(k- l)(r + l)r 
(r(A; - 1) + s)(r + 1) - ^ A ; > n. 



(3) 



We first determine r based on the above. The constraints < s < k — 1 translate into the 
constraints: 

r(k-l)(r + l)-r(k-l)(r + l)/2 < n + 1 and (r + l)(k-l)(r + 1) -r(k-l)(r + l)/2 > n + 1. 

The above, in turn, simplify to 



2 n+1 1 1 , 2 n+1 

r < \ \ ' and r > \ \ ' -1-1. 

~ \j k-1 4 2 \j k-1 

The requirement that r be an integer now allows us to determine r precisely: 



/2(n + l) 1 1 



1 k-1 4 2 
The value s can now be determined from (3) and we get: 

~2(re + l) -r(r + l)(k- 1) 
2(r + l) 

The lemma follows by setting r k , a = r and t = r{k — 1) + s + 1 and using the fact that 
e(k, n) = n — t — 1. I 

The following lemma simplifies some of the expressions above by examining the error- 
correction rate e(n) as a function of k. 
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Figure 1: The new error correction bound c(k) plotted against the rate k. Also described are 
the error correction radius of the code and the distance of the code. 



Lemma 9 



c(k) = 1 
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e(/s) can 6e lower bounded as follows: 
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(4) 



VkG[0,1], e(«) > 1-W2/C + — + k/2 
> 1 -V2k 



(5) 
(6) 



Remark: The bound (4) is described pictorially in Figure 1. The lower bounds (5) and (6) 
are compared against the bound (4) in Figure 2. 



Proof: (4) follows from Lemma 8 by letting p K = linin^oo r Ka;a and simplifying the quantity 
lim^oo e(nn } n)/n. 

To prove (5), we hrst show that 
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Figure 2: Lower bounds on the error-correction rate of our algorithm. 



where p' K = W-| + | — | . Notice that by the dehnition of p' K , we have 



p' K <P«<p' K + l and - = (^ + 2 )(^ + l). 



(7) follows from: 



1+Pk 1+Pk 2 2 — 

> o 



(Since p K > p' K : 



(1 + ^)(1 + Pk) 2 

(i+pJ(i+p k ) ^ (IT^fe+Z) (Sincef = (l + p'J(2 + p'J) 



where the last inequality follows from the properties of p' K . 
(5) now follows from the following equalities: 
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Finally inequality (6) is derived easily as follows: 
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5 Conclusions 

The algorithm as presented here does generalize naturally to dealing with erasures, for the 
simple reason that we made no assumptions about the x^s (we didn't depend on them being 
in some specific order etc.). However the bounds can not be represented elegantly in any 
form - so we do not attempt to do so here. We only point out that since the Welch-Berlekamp 
algorithm is a special case of ours, we do at least as well. 

There are a number of questions on decoding Reed Solomon codes that remain open. For 
instance, is there an algorithm that can decode from more errors (than (d — l)/2) when 
k = k/n > 1/3? A nice target would be a decoding algorithm that works for e(/c) < 1 — t/k. 
In this case we know (cf. [5, 10]) that the number of codewords within a distance of e(n)n 
is bounded by a polynomial in n. One does expect that the problem will become harder as 
e(/c) —>■ 1 — k. It would be interesting to see if the problem becomes NP-hard as e(/c) —>■ 1 — k. 
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